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Abstract 

Consider several source nodes communicating across a wireless network to a destination node with the help of 
several layers of relay nodes. Recent work by Avestimehr et al. has approximated the capacity of this network up 
to an additive gap. The communication scheme achieving this capacity approximation is based on compress-and- 
forward, resulting in noise accumulation as the messages traverse the network. As a consequence, the approximation 
gap increases linearly with the network depth. 

This paper derives a capacity approximation for multi-layer wireless relay networks with an approximation 
gap that is independent of the network depth and dependent only on the number of source nodes and the fading 
statistics. This is achieved by a new communication strategy termed computation alignment. This strategy is 
based on the compute-and-forward framework, which enables relays to decode deterministic functions of the 
fT) | transmitted messages. Alone, compute-and-forward is insufficient to approach the capacity as it incurs a penalty for 

approximating the wireless channel with complex-valued coefficients by a channel with integer coefficients. Here, 
this penalty is avoided by breaking the wireless channel into several subchannels combined with a signal-alignment 
strategy to ensure that these subchannels have integer channel coefficients. 
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I. Introduction 



Consider a line network, consisting of a single source communicating to a single destination via a 
(Nj | sequence of relays connected by point-to-point channels. The capacity of this simple relay network is 
• achieved by decode-and-forward and is determined solely by the weakest of the point-to-point channels. 
, As a consequence, the performance of the optimal scheme is unaffected by noise accumulation, regardless 
q£ \ of the length of the relay network. This raises the question whether the same holds true in general multi-user 
O wireless relay networks, i.e., if the capacity depends on the network depth. In this paper, we investigate this 
question in the context of multiple sources communicating with a single destination across a multi-layer 
• • ! wireless relay network. 



A. Motivation and Summary of Results 

In a multi-layer wireless relay network, each relay observes a noisy linear combination of the signals 
transmitted by the relays in the previous layer. In order to avoid noise accumulation, the relays should 
perform some type of decoding to eliminate noise at each layer. One natural approach, decode-and-forward, 
is for each layer of relays to decode the messages sent by the previous layer and retransmit them, just as in 
the line network mentioned above. Unfortunately, while the performance of this scheme is independent of 
the network depth, it is often interference limited and, as a result, its performance can diverge significantly 
from the capacity. 

Instead of combating interference, as is done in the decode-and-forward approach, other communica- 
tion strategies embrace the signal interactions introduced by the wireless channel. One such strategy is 
compress-and-forward, in which each relay transmits a compressed version of its received signal. Such 
strategies can offer significant advantages over decode-and-forward. Indeed, recent work by Avestimehr 
et al. [1] has shown that, for the class of multi-layer wireless relay networks with single destination 
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considered here, compress-and-forward approximately achieves capacity up to a gap independent of the 
power constraints at the nodes in the network. However, since the compress-and-forward scheme does not 
remove noise at each relay, noise accumulates from one layer in the network to the next. As a consequence, 
the approximation gap in [1] (and related ones such as those based on noisy network coding [2]) increases 
linearly with the number of layers in the relay network. Thus, as the depth of the network increases, the 
approximation guarantee becomes weaker. 

In this paper, we make progress on this issue by deriving a new capacity approximation result for the 
multi-layer relay network with an approximation gap dependent only on the number of source nodes in 
the network and on the fading statistics, but independent of the depth of the network. In other words, 
unlike the approximation result in (T), the approximation guarantee here is not affected by increasing the 
depth of the relay network. 

Our approach is built around the compute-and-forward framework proposed by [3]. In this framework, 
each transmitter encodes its message into a codeword drawn from the same lattice codebook. As a result, all 
integer combinations of codewords are themselves codewords, enabling relays to decode linear functions 
of the transmitted codewords rather than treating interference as noise. While the use of lattice codes 
seems like a natural fit for this setting, it alone is insufficient to approach the network capacity, as was 
shown recently in Q. The primary reason is that this scheme approximates the wireless channel with 
complex- valued channel gains by a channel with integer- valued channel gains. The residual signals not 
captured by this integer approximation are treated as additional noise. It is this additional noise that 
ultimately limits the performance of this scheme in the high signal-to-noise ratio (SNR) regime. This 
obstacle was overcome in [4] in the high SNR limit by combining compute-and-forward with the rational 
alignment scheme due to Motahari et al. 0. For the time- varying channels considered here, we propose 
a new scheme, termed computation alignment, that allows for a much sharper analysis at finite SNRs. Our 
scheme couples compute-and-forward with a signal-alignment scheme inspired by ergodic interference 
alignment [6|. This approach decomposes the wireless channel with time- varying complex-valued channel 
gains into several subchannels with constant integer- valued channel gains, over which lattice codes can 
be employed efficiently. 

B. Related Work 

Relay networks have been the subject of considerable interest. For wired networks (i.e., networks of 
point-to-point channels), Koetter et al. recently proved that it is capacity-optimal to separate channel and 
network coding Q. It is now well known that routing over the resulting graph of bit pipes is optimal for 
unicasting (8]|, BUl and, as demonstrated by Ahlswede et al. IfTOl . network coding is required to achieve 
the multicast capacity. 

For wireless networks, channel-network separation is not always optimal: higher rates can be achieved 
using more sophisticated relaying techniques such as compress-and-forward (see, e.g., HI, GJ, lfTTll - |fT3~1 ), 
amplify-and-forward (see, e.g., lfl4H~lfT8ln. and compute-and-forward (see, e.g., 0, Hi, OH, ||2D||). While 
for certain classes of deterministic networks the unicast and multicast capacity regions are known flU, 
ETI . Il22l . in the general, noisy case, these problems remain open. Recent progress has been made by 
focusing on finding capacity approximations flU, Il23l - [|26ll . 

As mentioned above, our approach combines signal alignment with lattice coding techniques. Signal 
alignment for interference management has proved useful especially for the Gaussian interference channel 
ED, flUL E4l . Il27ll - ll29ll . Recent work has developed alignment schemes that attain the maximum degrees 
of freedom of certain Gaussian multi-hop networks 113011 - 113311 . 

Lattice codes provide an elegant framework for many classical Gaussian multi-terminal problems [1341 , 
11331 . Beyond this role, it has recently been shown that they have a central part to play in approaching the 
capacity of networks that include some form of interference OH. 1191. EDI. 1231. 1291, |[3oTl, ||57|. 



3 



C. Organization 

The remainder of this paper is organized as follows. Section [II] introduces the problem setting as well 
as notation. Section [TIT] presents the main results. Sections HVT - IVIIII contain proofs. Section ITXl concludes 
the paper. 

II. Problem Setting and Notation 

This section formally introduces the problem setting and notation. Although we are interested here in 
relay networks with several layers, it will be convenient to first discuss networks with a single layer. This 
single-layer network model is presented in Section ITl-BI We then apply the insights obtained for networks 
with a single layer of relays to networks with more than one layer of relays. This multi-layer network 
model is presented in Section ITl-Cl Before we formally describe these two problem settings, we introduce 
some notational conventions in Section III-AI 

A. Notational Conventions 

Throughout this paper, log denotes the logarithm to the base two, and all capacities and rates are hence 
expressed in terms of bits. We use bold font lower and upper case, such as h and H, to denote vectors 
and matrices, respectively. Realizations of random variables will be denoted by sans-serif font, e.g., H is 
a realization of the random matrix variable H. 



B. Single-Layer Relay Networks 

We start with a model for a wireless relay network with a single layer. This single layer is to be 
interpreted as a part of a larger relay network, to be introduced formally in Section III-C1 The single- 
layer relay network consists of K transmitters and K receivers as depicted in Fig. Q] We think of the K 
transmitters as being located at either the source nodes or at the relay nodes in some layer, say d, of the 
larger relay network. We think of the K receivers as being located at either the destination nodes or at 
the relay nodes at layer d + 1 of the larger relay network. 
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Fig. 1. K transmitters communicate an invertible set of functions uu = fk(wi,W2, ■ ■ ■ ,Wk) of their messages to K receivers over a 
time-varying interference channel. 



Each transmitter, indexed by k G {1, ... , K}, has access to a message Wf. that is generated independently 
and uniformly over {1, . . . , 2 TRk }, where Rk is the rate of transmitter k. Each receiver, indexed by 
m G {1, . . . , K}, aims to recover a deterministic function 

of the K messages (wi, . . . , Wk)- We impose that the functions (f m )„\ =1 computed at the receivers are 
invertible. In other words, there must exist a function g such that g(u\, u 2 , ■ ■ ■ , %) = (w 1 , w 2 , ■ ■ ■ , wk). 
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Since the functions to be computed at the receivers are deterministic, noise is prevented from accumulating 
as messages traverse the larger relay network. Moreover, since the functions to be computed are invertible, 
no information is lost from one layer to the next in the larger relay network. 

The transmitters communicate with the receivers over a Rayleigh-fading complex Gaussian channel 
modeled as follows. The channel output y m [t\ G C at receiver m G {1, . . . , K} and time t G IN is given 
by 

K 

Vm [t] =^2h m:k [t]x k [t] +Zm[t], ( 1 ) 

k=l 

where x k [t] G C is the channel input at transmitter k, h myk \t\ is the channel gain between transmitter k 
and receiver m, and z m [t] is additive receiver noise, all at time t. The noise z m [t] is circularly-symmetric 
complex Gaussian with mean zero and variance one, and independent of the channel inputs x k [t] for 
k G {\,...,K}, t G IN, and independent of all other z m >[t'] for {ml \t') ^ (m,t). Each channel gain 
h m ,k[t\ is assumed to be circularly-symmetric complex Gaussian, with mean zero and variance one, i.e., 
we assume Rayleigh fading. As a function of t, (h m! k[t])teN is a stationary ergodic process for every m 
and k. The K 2 processes (h mj k[t])teN are mutually independent as a function of m,k. Denoting by 

H[t] = (h mt k[t]) m> k 

the matrix of channel gains at time t, this implies that the matrix process 

H[1],H[2],H[3],... 

is also stationary and ergodic. The channel gains H[t] are known at all nodes in the network at time 
t. In other words, we assume availability of full causal channel state information (CSI) throughout the 
network. 

Each transmitter consists of an encoder S k mapping its message w k into a sequence of T channel inputs 

(x k [t])J =1 = £ k (w k ), 

satisfying an average power constraint 

1 T 

^5> fc [*]| 2 <P. 
t=i 

Each receiver consists of a decoder V m mapping its observed channel output into an estimate 

u m = T> m (y m [l], . . .,y m [T\) 
of the desired function u m = f m (w 1: . . . , w K ). Let 

denote the average probability of error across all relays. 

Definition. A computation sum rate R(P) is achievable if, for every e > and every T > T (e), there 
exist encoders with blocklength T, average power constraint P, and rates satisfying J2k=i^-k — -^(-P)> 
and there exist decoders computing some invertible deterministic function (/ m )^ =1 with probability of 
error p e < e. The computation sum capacity C(P) of the single-layer relay network is the supremum of 
all achievable computation sum rates R(P). 

Observe that the definition of computation sum capacity does not prescribe the function of the messages 
to be computed at the receivers. The only requirement is that these functions are deterministic and 
invertible. In other words, the computation sum capacity is the largest sum rate at which some (as opposed 
to any) such function can be computed reliably. 
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C. Multi-Layer Relay Networks 

Having described the single-layer network setting, we now turn attention to networks with multiple 
layers of relays. These networks consists of a concatenation of D single-layer networks as defined in 
Section Ill-BI The network contains K source nodes at layer zero connected through a Rayleigh-fading 
channel to K relay nodes at layer one. Layer d in the network contains K relay nodes connected through 
a Rayleigh-fading channel to K relay nodes at layer d + 1. The relay nodes at layer D are connected to 
the destination node at layer D + l through orthogonal bit pipes of infinite capacity. This ensures that the 
intermediate relay layers, not the bit pipes, are the bottleneck in the network. This scenario is depicted 
in Fig. [21 
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Fig. 2. A multi-layer relay network with D = 2 layers and K relays per layer. 



Formally, each transmitter at a source node, indexed by k & {1, ... , K], has access to a message Wk of 
rate Rj- that is generated independently and uniformly over {1, . . . , 2 TRk }. The receiver at the destination 
node aims to recover the transmitted messages (wi, w 2 , ■ ■ ■ , wk)- 

The transmitters at layer d E {0, . . . , D — 1} communicate with the receivers at layer d + l over 
a Rayleigh-fading complex Gaussian channel modeled as in the single-layer case. The channel output 
Dm [t] G C at the receiver at relay m e {1, . . . ,K} in layer d + l and time t e IN is given by 

fc=i 

where x^ft] is the channel input at the transmitter at relay or source G {1, . . . , K} at layer d. The 
channel gains h^ + ,}\t\ and the additive noise Zm +1 ^ satisfy the same statistical assumptions as in the 
single-layer network, see CQ), and are assumed to be independent across different layers. As mentioned 
earlier, the receiver at the relay nodes in layer D are connected to the destination node at layer D + l 
through K orthogonal bit pipes with infinite capacity. Without loss of generality, we can assume that the 
relays in layer D simply forward their observed channel outputs yffl [t] to the destination node. 

Each transmitter at source node k consists of an encoder £ k mapping its message Wk into a sequence 
of T channel inputs, 

(4 0) M)f=i = ^K), 

satisfying an average power constraint 

t=i 

The receiver-transmitter pair at relay node k in layer d G {1, . . . , D — 1} consists of a relaying function 
J 7 ^ mapping the block of observed channel outputs (y^ [1], • • • , Vm [T]) from layer d into a block of 
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channel inputs 

for layer d + 1, satisfying an average power constraint 

As may be seen from the definition of the relaying function, we do not impose causality for the operations 
at the relay. This assumption is only for ease of notation — since we are dealing with a layered network, 
all results are also valid for causal relaying functions by coding over several blocks. 

Finally, the receiver at the destination node in layer D + l consists of a decoder V mapping its observed 
channel outputs (forwarded from the relays at layer D) into an estimate 

(w u w 2 , ...,w K )± V((y[ D) [l], . . .,y[ D) [T}),. . . , (yg»[l], . . .,y^[T])) 

of the messages (wi, . . . , Let 

Pe = P(Uf=l{^ ± W k}) 

denote the average probability of decoding error. 

Definition. A sum rate RS D > (P) is achievable if, for every e > and every T > T (e), there exist encoders, 
relaying functions, and a decoder with blocklength T, average power constraint P, rates satisfying 
J2k=i R k > P (D) (P), and probability of error p e < e. The sum capacity C^ D \P) of the multi-layer 
relay network is the supremum of all achievable sum rates PS D \P). 

It is worth pointing out that C^ D \P) with D = 1 as defined here and C(P) defined in Section ITl-BI are, 
in general, not equal. In fact, C^(P) is the rate achievable over a single-layer network with a centralized 
decoder (having access to all the channel outputs at the K relays) at the destination node aiming to recover 
all K transmitted messages. On the other hand, C(P) is the rate achievable over a single-layer network 
with K decentralized decoders at the relays aiming to recover an invertible function of the K transmitted 
messages. Clearly, any such decentralized set of decoders can be used to implement a centralized decoder 
by concatenating them with an inverter. Hence C^-'(P) > C(P). However, we do not have equality in 
general. 

III. Main Results 

As the main results of this paper, we provide an approximate characterization of the computation sum 
capacity C (P) of the single-layer relay network (Section IIII-AI) . and an approximate characterization of 
the sum capacity C <kD \P) of the P-layer relay network (Section [III-BI) . 

A. Single-Layer Relay Networks 

We start with the analysis of the computation sum capacity C(P) of a single-layer relay network 
consisting of K source nodes and K relay nodes. 

Theorem 1. For a time-varying single-layer network with K > 2 source nodes and K relay nodes, the 
computation sum capacity C(P) is lower and upper bounded as 

IHog(P) - 7K 3 < C{P) < iHog(P) + 5K\og(K) 

for every power constraint P > 1. 

The proof of the lower bound in Theorem[T]is presented in Sections |V] (for K = 2) and|Vj](for K > 2). 
The proof of the upper bound in Theorem \T\ is presented in Section IVIII 
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Theorem Q] provides an approximate characterization of the computation sum capacity C(P) of the 
single-layer relay network. Comparing the upper and lower bounds shows that the approximation is up 
to an additive gap of 7K 3 + BKlog(K) bits/s/Hz. In particular, the gap does not depend on the power 
constraint P. In other words, Theorem Q] asserts that 

C(P) = K\og(P)±0(l). 

This is considerably stronger than the best previously known bounds in [4] on the computation sum 
capacity of such networks, which only provide the degrees-of-freedom approximation 

c(p) = inog(p)±o(iog(p)) 

as P — > oo. 

The upper bound in Theorem \T\ results from the cut-set bound, allowing cooperation among the sources 
and among the relays. This transforms the channel into a K x K multiple-input multiple-output system, 
and the upper bound follows from analyzing its capacity. From Theorem [H we hence see that computation 
of a (carefully chosen) invertible function can be performed in a distributed manner with at most a 0(1) 
loss in rate compared to the centralized scheme in which the K transmitters cooperate and the K receivers 
cooperate. 

The communication scheme achieving the lower bound in Theorem \T\ is based on a combination of a 
lattice computation code with a signal- alignment strategy, which we term computation alignment. We now 
provide a brief description of these two components and how they interact — the details of the argument 
can be found in the proof of Theorem \T\ in Sections |V] and [Vj] 

A lattice is a discrete subgroup of R T , and hence has the property that any integer linear combination 
of lattice points is again a lattice point. A lattice computation code as defined in uses such a lattice, 
intersected with an appropriate bounding region to satisfy the power constraint, as its codebook. Assume 
for the moment that the channel gains H are all integers. Then the receivers observe an integer linear 
combination of codewords plus Gaussian noise. By the lattice property, this is equal to some other 
codeword plus noise. If the lattice is carefully chosen, the receivers can remove the noise, and are hence left 
with the integer combination of the codewords — thus decoding a deterministic function of the messages. 

While this approach is close to optimal for channels with integer channel gains H, it was shown in 
AH that this approach performs quite poorly for general real or complex channel gains in the high SNR 
regime. The reason for the poor asymptotic performance of the lattice scheme is that only integer linear 
combinations of lattice codewords result in a codeword. For general real or complex channel gains, the 
linear combinations of the codewords computed by the channel do not result in another codeword. The 
residual interference resulting from this effect ultimately limits the performance of lattice computation 
codes as the signal-to-noise ratio increases. 

To circumvent this problem, we combine the lattice computation codes with a signal-alignment strategy. 
This signal- alignment strategy converts the channel with general complex channel gains into one with 
integer channel gains. Over this aligned integer channel, lattice computation codes can then be effectively 
used. Our signal- alignment strategy builds on the construction for interference alignment in [6] and [|28l . 

The computation- alignment scheme is best illustrated in the K = 2 user case. Consider two time slots 
t\ and t 2 . It is easy to see that the two transmitters can reliably communicate two data streams over these 
two time slots, for example using time sharing. This time-sharing approach can be expressed formally by 
choosing transmit vectors v^i G C 2 at transmitter k G {1,2} with v lt i = (1,0) and v 2y \ = (0,1). The 
corresponding data stream is multiplied by this transmit vector and sent over the channel at times t\ and 
t 2 . 

We now argue that, over two time slots, we can in fact reliably compute invertible functions of three 
data streams. To this end, we need to carefully match pairs of channel gains. Fix a time slot ti, and 
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assume we can find a second time slot U such that 



hi,i[t 2 ] 


= hx,i [h], 


hi j2 [t 2 ) 


= h i2 [h], 


h 2j2 [t 2 ) 


= h 2 ,2[ti), 




= -h 2 ,i[ti 



If we choose transmit vectors 



Vi 2 



at transmitter two, and transmit vector 



V2,l 



(1, 1), 

fcl,l[*l]fr2,2N 
^l,2[tl]^2,l[*l] 



x -1: 



1, 1 



at transmitter one, then the resulting received vectors are either equal (and hence aligned) or orthogonal, 
as depicted in Fig. [3] Using matched filters at the receiver, we can hence separate the channel output 
into two data streams. By construction, each of these data streams corresponds effectively to an additive 
Gaussian channel with integer channel coefficients, over which lattice computation codes can be used. 



Transmitter 1 



Receiver 1 





"1,1 = -Di i2 «2,l 



. £>i,it>i,a 



Transmitter 2 



"2,1 



Receiver 2 




/ D 2 il)l 2 = £>2,2"2,1 



Fig. 3. Signal alignment at the receivers. Transmitter one has two transmit vectors Ui, 1,171,2, and transmitter two has one transmit 
vector i>2,i- At receiver one, vi t i and i>2,i are aligned. At receiver two, ui, 2 and w 2 ,i are aligned. The figure uses the notation D m> k = 
diag(ft m ,fc[ti], h m ,k[t2\) to express the action of the channel on the channel input over the two time slots t\ and <2. 

Of course, matching time slots t% and t 2 such that the above conditions are satisfied exactly is not 
possible with probability one. We will instead use a quantization argument to show that this matching is 
possible to an arbitrary degree of accuracy. This shows that, over two time slots, we can reliably compute 
invertible functions of three data streams, achieving a computation sum rate of approximately |log(P). 
To achieve a computation rate of approximately 21og(P), we consider L time slots, and show that we 
can reliably compute invertible functions of 2L — 1 data streams over it. This results in a computation 
sum rate of approximately ^p^log(P), which converges to the desired 21og(P) as L — > oo. The gist of 
the proof is similar for K > 2, but the arguments are more involved. 
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B. Multi-Layer Relay Networks 

Having analyzed the computation sum capacity for single-layer relay networks, we now turn to the 
sum capacity of relay networks with multiple layers. Unlike the single-layer network, there is only one 
destination node, which is interested in recovering the original messages (and not merely a function of 
them). We are hence interested here in sum capacity in the traditional sense. 

Theorem 2. For a time-varying relay network with D > 1 layers, K > 2 source nodes, and K relay 
nodes in each layer, the sum capacity (P) is lower and upper bounded as 

Klog(P) - 7K 3 < C iD) (P) < K\og(P) + 5Klog(K) 

for every power constraint P > 1. 

The proof of Theorem [2] is presented in Section IVIII1 The upper bound follows directly from the same 
cut-set bound argument as in Theorem [TJ The lower bound uses compute-and-forward in each layer as 
analyzed in Theorem [TJ The destination node gathers all the computed functions and inverts them to 
recover the original messages sent by the source nodes. 

Theorem [2] provides an approximate characterization of the sum capacity of the D-layer relay network. 
The gap between the lower and upper bounds is 7K 3 + 5Klog(K) bits/s/Hz as in Theorem [TJ This gap 
is again independent of the power constraint P, showing that 

C (D) (P) = K\og(P) ±0(1). 

Moreover, the gap in Theorem [2] is also independent of the network depth D. In other words, the 
approximation guarantee is uniform in the network parameter D. 

It is interesting to compare this approximation result to other known capacity approximations for general 
Gaussian relay networks of the form considered here. For general relay networks, these bounds rely on 
a compress-and-forward scheme, and achieve an additive approximation gap of A(D + \)K bits/s/Hz [Q]|, 
11581 . Unlike the gap in Theorem [2l this gap is not uniform in the network depth D. This is due to 
the use of compress-and-forward: In each relay layer, the channel output, consisting of useful signal as 
well as additive noise, is quantized and forwarded to the next layer. Thus, with each layer additional 
noise accumulates, degrading performance as the network depth increases. The result is an approximation 
guarantee that becomes worse with increasing network depth. 

Theorem [2] in this paper avoids this difficulty by completely removing channel noise at each layer in 
the network. This is achieved by decoding a deterministic (and hence noiseless) function of the messages 
at each relay. Thus, noise is prevented from accumulating as the messages traverse the network. It is this 
feature of compute-and-forward that enables the uniform approximation guarantee in Theorem [2] 

We remark that the 7K 3 term in the lower bound of Theorem [2] is due to the construction ensuring that 
all received signals are integer multiples of each other. If instead of Rayleigh fading we consider channel 
gains with equal magnitude and independent uniform phase fading, the lower bound in Theorem [2] can 
be sharpened to Klog(P). Deriving capacity approximations with better dependence on K for general 
fading processes is an interesting direction for future work. 

It is also worth mentioning that, unlike the gap presented here, the approximation gap in [[TJ is uniform 
in the fading statistics. Developing communication schemes that guarantee an approximation gap that is 
uniform in both the network depth and the fading statistics is therefore of interest. 

Finally, like other signal alignment schemes for time- varying channels such as ||28l and flU, the 
communication scheme proposed in this paper suffers from long delays. In particular, the delay required 
to establish signal alignment scales at least as 2 K in the number of users K. This limits the practicality 
of these schemes even for moderate values of K. Finding ways to achieve signal alignment (be it for 
interference management or function computation) with less delay is hence of importance. 
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IV. Channel Quantization 

The achievable scheme in Theorem[T]groups together time slots so that an appropriate linear combination 
of the channel outputs within each group yields a more desirable effective channel. This grouping of time 
slots is performed such that the corresponding channel realizations "match" in a sense to be made precise 
later. Since each possible channel realization has measure zero, we cannot hope for channel matrices to 
match exactly. Instead, we will look for channel matrices that approximately match. This approximate 
matching is described by considering a quantized version of the channel gains. In this section, we describe 
such a quantization scheme, similar to the one used for ergodic interference alignment in J6]|. 

First, we divide the complex plane from the origin up to distance v into concentric rings centered at 
the origin and with spacing 1/u for some natural number v > 2 to be chosen later. Then, we divide each 
of these v 2 rings into v 2 L segments with identical central angles of size 2tt/(u 2 L) for some L E IN" also 
to be chosen later. These segments serve as quantization cells for the channel coefficients. Each segment 
is represented by the mid-point on the bisector of the corresponding central angle (see Fig. |U). We add 
one additional quantization point at infinity to which all channel gains with magnitude larger than v are 
mapped to. We will use frequently in the following that multiplying a quantization point by any Lth root 
of unity results again in a quantization point. 




Fig. 4. Quantization scheme for channel coefficients. Coefficients up to magnitude v are quantized by magnitude and angle. The number 
of angular regions is a multiple of L to ensure that multiplying a quantization point by any Lth root of unity results again in a quantization 
point. In the figure, v — 2 and L — 2 

Let h mt k[t] denote the quantized version of the channel coefficient h m> k[t\ G C. We then have that 
h m ,k[t] = oo if |/i m| fc[t]| > v, and that h nhk [t] is the point in the "middle" of the quantization cell 
containing h m ^[t] otherwise (with ties broken arbitrarily). We denote by H the collection of all possible 
quantized channel values. It will be convenient in the following to denote by 

p A (H)4p(H[l] = H) 
the probability mass function of the quantized channel gains 

H[t) = (h m ,k[t))m,k- 

Note that the number of quantization regions is 

\U\ = v*L+l. (2) 
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By choosing v large enough, we can ensure that the distance between any point with magnitude less than 
v and its closest quantization point is arbitrarily small. In fact, for any h m:k [t] with |/i mi fc[t]| < u, 

\hm ik [t] - h m ,k[t]\ <{ir + l)/v. (3) 

Furthermore, for any 5 > 0, 

P(|/im,fcWI <^ Vm,fc) >l-6 
for large enough v by bounded convergence, and hence 

P(lV fc [t]|<ooVm,A;)>l-& ( 4 ) 
Therefore © holds with probability at least 1 — 5 for v large enough. Finally, for any h mjk [t] such that 

|^m,fc[*]l ^ v > 

{l^m.fcWU^m.fcWr 1 } < 2max{|/i mifc [t]|, 1/im.fcWr 1 }, (5) 



max 



since each finite quantization point is the mid-point of the corresponding bisector interval. 
Since the matrix process 

H[1],H[2],H[3},... 
is stationary and ergodic, the quantized process 

H[1],H[2],H[3},... 



is also stationary and ergodic (see, e.g., 11391 Theorem 6.1.1, Theorem 6.1.3]). Moreover, since each 
h m ,k[t] is circularly symmetric, and since the quantization procedure preserves this circular symmetry, the 
distribution of the quantized channel values h m) k[t] is invariant under multiplication by the Lth root of 
unity. Furthermore, since the K 2 processes (h mik [t]) t( z K are mutually independent as a function of m, k, 
so are the K 2 quantized processes (h m ^[t\)t&M- For future reference, we summarize these observations in 
the following lemma. 

Lemma 3. For each m, k, and t, the quantized channel gain h m>k \t\ and its rotation exp (\/— T^\h m ^\t\ 
have the same distribution. The K 2 quantized processes 

^m,fe[lj; ^-m,fc[2], ^m,fc[3], • • • 

are independent as a function of m, k. The quantized matrix process 

H[1],H[2],H[3],... 

is stationary and ergodic. 

We consider a block of length T of channel gains with T a multiple of L, and we divide this block 
into L subblocks each of length T/L. Count the number of occurrences of a particular channel realization 
H G i-L KxK in one of the L subblocks. By the ergodicity of the quantized matrix process, we expect 
this number to be close to T/L times the probability of this realization. The next lemma formalizes this 
statement. 

Lemma 4. For any L, v e IN and rj, e > 0, there exists T = T(L, u) e IN divisible by L such that, with 
probability at least 1 — e, we have, for all £ G {1, . . . , L}, and all H G l L L KxK , 

IT/L 

t{H[t] = H}>(l-rj)p A (H)T/L. 

t=(e-i)T/L+i 

Proof: By Lemma |3l the quantized matrix process 

H[1},H[2],H[3},... 
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is stationary and ergodic. This stochastic process takes values in the finite set H KxK , and hence, by the 
ergodic theorem (see, e.g., [|39l Theorem 6.2.1]), its empirical distribution converges to the true distribution 
almost surely. For fixed £ G {1, . . . , L}, this implies that there exists a T such that with probability at 
least l-e/L, we have for all H 6 H KxK , 

IT/L 

l{A[t) = H}>(l-r,)p A (H)T/L. 

t=(£-l)T/L+l 

Applying the union bound over £ £ {1, . . . , L} proves the result. ■ 

V. Proof of Lower Bound in Theorem [Qfor Two Users 

In this section, we prove the lower bound in Theorem Q] for the two user case, i.e., K = 2. Consider 
a block of T channel gains, and divide this block into L subblocks each of length of T/L (assumed to 
be an integer). The construction of the achievable scheme in Theorem \T\ consists of three main steps. 
First, we carefully match L time slots, one from each of the L subblocks. This matching is performed 
approximately T/L many times such that essentially all time slots in the block of length T are matched 
(see Section IV-Al) . Second, we argue that any L time slots matched in this fashion, when considered jointly, 
can be transformed into a channel with (nearly) integer channel gains using appropriate linear precoders 
at the transmitters and matched filters at the receivers (see Section IV-BD . Third, we show that over this 
integer channel we can efficiently and reliably compute functions of the messages (see Section IV-CI) . 

A. Matching of Channel Gains 

We start with the matching step. Since the number of possible channel realizations is uncountable, only 
approximate matching is possible. To this end, we quantize each of the channel gains as described in 
Section [IV] Denote by h m ^[t] the quantized version of the channel gain h m ^\t\. By Lemma Ul for every 
E\ > and i] > 0, there exists T large enough such that with probability 1 — e\, each of the L subblocks 
is "typical", in the sense that, for every subblock £ E {1, . . . , L}, and every realization H e 7^ xA ' Q f the 
quantized channel gains, 

IT/L 

J2 l{A[t) = H}>(l-ri)p A (H)T/L. 

t=(£-l)T/L+l 

Recall that full CSI is available at all transmitters and receivers. Hence all transmitters and receivers 
can determine at the end of the block of length T if the realization of quantized channel gains is typical. 
Whenever this is not the case, the decoders declare an error. By the argument in the last paragraph, this 
happens with probability at most ex. We assume in the following discussion that the quantized channel 
gains are typical. 

We can then assume that every matrix of quantized channel gains H appears exactly^ 

(1 - v )p A (H)T/L (6) 

many times in each of the L blocks, ignoring all the remaining time slots. This results in a loss of at most 
a factor (1 — 77) in rate. Furthermore, by the full CSI assumption, all receivers and transmitters can reorder 
the time slots within each of the L subblocks, and hence we may assume without loss of generality that 
the first (1 — rj)T/L quantized channel gains in each subblock satisfy this condition. 



'Since T/L will grow to infinity, we can assume here that ((6} is integer and avoid floor operators. 
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We now describe the matching procedure alluded to earlier. Consider the channel gains at time t\ — 1 
in the first of the L subblocks and the corresponding matrix of quantized channel gains H\ti\. Let tt be 
the first time in block t G {2, ■ ■ • , L} such that 

kM = KM ( 7a ) 

Km = Km (7b) 
Km = Km ( 7 °) 

KM-^KM (7d) 

where 

a* 4 exp (V=T$) 

is the Lth root of unity. By construction of the quantization scheme, if h G "H then co^h G "H, and hence 
such a collection of time slots i 2 , • • • , tr, can exist. Since ti < t 2 < ■ ■ ■ < t L , this matching procedure can 
be performed in a causal manner. Moreover, by the full CSI assumption, this matching can be computed 
at each transmitter and receiver. 

Having performed the matching for t\ = 1, we proceed with t\ = 2. We again match channel gains in 
the same fashion, ensuring that each time slot tt in subblocks i G {2, . . . , L} is chosen at most once. In 
other words, this matching procedure constructs many nonintersecting L-element subsets {ti, . . . ,t L } of 
{1, . . . , T}. We now argue that this procedure can be continued successfully up to t\ = (1 — rj)T/L, i.e., 
(1 — rj)T / L of these subsets can be found. 

Consider a time slot t\ in the first subblock and the corresponding channel gains -ff^i]. This channel 
gain induces matched channel gains 

H[t 2 },H[t 3 },...,H[t L ], 

within subblocks 2, . . . , L. Hence, the distribution of the channel gains H[ti] at some fixed t\ induces a 
distribution of the channel gains H[ti] for I G {2, . . . , L}. It is not clear a priori that H[ti] and H[t] for 
any fixed t have the same distribution. 

The key observation for the analysis of the matching procedure is the following: By ©, the matching 
procedure is successful for all t\ G {1, . . . , (1 — r))T/L} if the distribution of H\tf\ for I G {2 . . . , L} is 
the same as the distribution of H[(£— 1)T/L + 1] (or any other channel matrix at fixed time t in subblock 
£). By stationarity, the distribution of H[(£ — 1)T/L + 1] is the same as the distribution of H[l]. Hence, 
it suffices to argue that H[tg] has the same distribution as H[l], i.e., that H\t(\ has distribution p^. We 
now show that this is the case. 

By assumption, the distribution of each channel gain h m ^[t] is circularly symmetric. By Lemma [3} 
the quantization scheme preserves this circular symmetry, in the sense that all possible quantized channel 
gains with the same magnitude have the same probability. Since the components of H[tj are independent 
by Lemma [31 this circular symmetry also holds for their joint distribution, i.e., if H and H' satisfy 
|hm,fc| = \h' mk \ for all m,k, then 

pa(hO=pa(h). 

Observe now that, for each m, k, the channel gains 

hm t k[tl], h m> k[t2\, ■ ■ ■ , h mt k[tL\, 

all have the same magnitude by the matching condition ©. Moreover, since the distribution of H\t\\ is 
circularly symmetric, and since © results in a fixed phase shift, the induced distribution of the matched 
channel gains H[tg) is circularly symmetric as well. Together, these two facts shows that the distribution 
of the quantized channel gains induced by the matching within the subblocks i G {2, . . . , L} is identical 
to the distribution of the quantized channel gains within the first subblock. This implies that the time slots 
t\ = 1 up to t\ — (1 — rj)T/L can be matched up by the described procedure. 
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Out of the (1 — i])T/L time slots that are matched in this fashion, at most 5(1 — rj)T/L contain a 
quantized channel gain equal to infinity by © for some 5 = 8(u) (where v is the parameter governing 
the number of quantization points). These time slots are not used. Again by the full CSI assumption, this 
event can be observed at each transmitter and receiver. Accounting for the time slots that are not matched, 
a total of at least (1 — 77) ( 1 — 8)T/L time slots in each subblock are used for communication. 

To summarize, the channel gains in each of the L subblocks are matched up to satisfy C7]). With 
probability at least 

l-ei(T), (8) 

at least a fraction 

(l-r](T))(l-6(v)) 

of the time slots in each subblock can be matched in this fashion such that all the corresponding channel 
gains have finite magnitudes. Here the parameters can be chosen to satisfy 

Iimei(T) = 0, (9) 

T— >oo 

lim T](T) = 0, (10) 



T-t-oo 



both for fixed values of L and v, and 



lim <J(i/) = 0. (11) 



B. Precoding and Matched Filtering 

Consider time slots t\, . . . , in subblocks 1, . . . , L that are matched as described in the last section. 
We now describe a linear precoding transmitter design and matched filtering receiver design that transform 
the complex channel over these L time slots into an integer channel. 

Construct the diagonal matrix 

D m>k = diag ((h m ,k[t<e\)f=i) , 

from the L matched channel gains between transmitter k and receiver m and define -D m ,fc in the same 
manner, but with respect to h m ^[ti\. Observe from © that 

^ fV* 1 if (m,fc)^(2,l), 
m ' fc \h m>h F if (m, k) = (2,1) 

by the matching procedure, where 

hm,k — ^-m,fc[^l]; 

and 

F4 d iag((4- 1 )i 1 ). 

Each transmitter uses a linear precoder over the block of L symbols. In other words, each message 
symbol is multiplied by a transmit vector in <C L . The modulated transmit vectors are summed up by the 
transmitter. At time ti, the component t of this sum of vectors is sent over the channel. 

Transmitter one uses a total of L transmit vectors. The first transmit vector is 

where 1 denotes the vector of all ones. The remaining transmit vectors for t 6 {2, . . . , L} are obtained 
recursively as 

a , rS rS u ^2,2^1,1 „-\ 

v hl = b e D 2 1 D 2t2 D 1 2 A,i«m-i = be ? — ; — F 

h 2 ,ih 1>2 
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where the scalar bi is of the form n or 1/n for the smallest natural number n E IN such that 

Wv^W/VZe [1,2). 



(12) 



For convenience of notation, we set b\ = 1. The scalar &£ can be interpreted as a normalization constant; 
it ensures that all transmit vectors have approximately the same norm. The reason that bi has to be a 
small integer or its inverse will become clear later. 

Transmitter two uses a total of L — 1 transmit vectors, which are defined through the transmit vectors 
for transmitter one. Transmit vector v 2} e for £ E {1, . . . , L — 1} is 



Define 



It follows from (O that 



and that 



c = c(H 



v 2j £ = D^Di^Vi 

n 



hi >2 

max ||/?, m) fc|, |/i m fc| j. 



m,k 



1/c < M < ll^ll/VZ < 2iM < 2c 



(13) 

(14) 
(15) 



max{6^, 6^ x } < 2c. 
Observe that every transmit vector Vk,i is of the form 

v kti = P F a l 

for some scalars p E C and a£l. Since 

1 T F Q 1 = l {aeLZ} , (16) 

this ensures that two different transmit vectors are either collinear or orthogonal. 

Each transmitter modulates independent message symbols over its transmit vectors. We will provide 
a detailed description as to how these symbols constitute codewords across matchings of time slots in 
Section IV-CI Transmitter one has access to L symbols s 1 ) i, . . . , s^l and transmitter two has access to 
L — l symbols S2,i, • • • , S2,l-i- We assume that all these 2L — 1 symbols have zero mean and are mutually 
independent. 

The vector of channel inputs 

x k = (x k [t 1 ] x k [t 2 ] ■■■ x k [t L ]) T 
at time slots t 1( . . . , t L at transmitter k E {1,2} is given by 

L 



and 



Xo 



L-l 



We allocate the same amount of power 



P 
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to each s^. Since 11^/11 2 < 4Lc 2 by (fT2l) and (fl4l) . we have 

^EdlajfcH 2 ) < P, 

satisfying the overall average power constraint of P over the L time slots ti, . . . , iz,. Since transmitter 
one has L transmit vectors and transmitter two has L — 1 transmit vectors, the transmitters send a total 
of 2L — 1 data streams over L channel uses. 
The vector 

Vm-(Vm[ti] y m [h] ■■■ y m [ti]) 
of channel outputs at time slots ti, . . . , tt at receiver m is then 

Um = D m> iX\ + D m ^ 2 X 2 + Z m 

£ L-l 

= si^£> m) iu lj£ + ^ s 2> iD m>2 V2,e + z m , (18) 

where 

Z m = ( z m[tl\ Zmfa] ■'' ^mlXd) 

is the additive noise at receiver m. 

Each receiver m e {1, 2} uses matched filters on the received channel outputs y m , which we describe 
next. Assume for the moment that the channel gains D m ,k are equal to their quantized version D m ^- 
Consider two transmit vectors Vij and v 2> t. At receiver one, these transmit vectors are observed as 

D^v^t = h^ivxj, (19a) 

and 

h 

Di,2V 2 ,t = Di^^-v^i = h ltl v 1;i . (19b) 
hi,2 

Hence the two transmit vectors Vi/ and v 2 ,t are aligned at receiver one. Similarly, the two transmit vectors 
Vi^+i and v 2 / are observed at receiver two as 

D 2>1 v 1>t+1 = b i+1 D 2>2 D^D hl v lti = b i+1 h 2t2 v 2/ , (20a) 

and 

D 2)2 v 2ji = h 2)2 v 2> e. (20b) 

Again, the two transmit vectors and v 2 ^ are aligned at receiver two. Moreover, all other transmit 

vectors are received in orthogonal directions at both receivers by (fT6l ), since they have a different power 
of the diagonal "Fourier" matrix F. Thus each receiver observes a linear combination of L orthogonal 
vectors with L — 1 of these vectors carrying an aligned message symbol from both transmitter one and 
transmitter two. This situation is illustrated for L = 2 in Fig. [3] in Section ITlI-Al 

Thus, if D m> h = D m ,k for all m, k G {1, 2}, then each receiver can project the received signal onto the 
L rotated transmit vectors. For one of these vectors, there is just one transmitted signal. For the remaining 
L—\ observed vectors, the receiver must decode the sum of the two aligned data streams. Formally, this 
operation of the receivers is implemented by multiplying the vector of channel outputs y m by the matched 
filter 

for m = 1, j 6 {1, . . . , L} and for m = 2,j e {1, . . . , L — 1}, to form 
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In general, the channel gains are not equal to their quantized versions, i.e., D m>k ^ D m ,k- However, 
since we only communicate during time slots satisfying |/i m fc[t]| < v, the quantization error is upper 
bounded by © as 

\h m ,k[te\ - h mtk [U}\ < (tt + 1)/u, 

so the matrices D m ^ and D m ,k are quite close for quantization parameter v large enough. We will use 
the same receiver structure as for the perfectly matched case, i.e., we use the matched filters according 
to the quantized matrices. To determine performance, we will bound the additional interference that is 
caused by imperfect alignment (received vectors do not line up) and imperfect zero forcing of interference 
(received vectors are not orthogonal). 

Using (TT8T ), the output of the jth matched filter v m j at receiver m is then 

L L-1 

t=\ e=i 

Defining 

T 1 1 1 1 

m, k J - y m,k J -'m,k 



as the (diagonal) matrix of channel quantization errors, we can rewrite the output of the matched filter at 
receiver one 

,~,t ... _ c ._ .„~,t n .„,.., „. .»~,t 



+ v\ jZl (21a) 
for j G {1, . . . , L — 1} and 

v\ L yi = s 1:L v\ L D 1A v 1)L 

+ s ltL v\ L T 1A v ltL + J2e=i s i,M,L D ^ v hi + Y^i s 2jv\,L D ia v *A 
+ v{ L Zi (21b) 
for j = L. Similarly, we can rewrite the output of the matched filter at receiver two as 

t>5j!/2 = S ltj+ ivl d D 2>1 V 1>j+1 + S2 :j vijD 2 ,2V2,j 

+ s ltj+1 vljT 2 ,iv ltj+1 + s 2 ,jvl ii T 2 ,2'U2,j + 52eft+i s i,eV2j D 2A v W + J2eft s 2,evl ;j D 2j2 v 2j e 
+ v\jZ2 (21c) 

for j G {1, . . . , L — 1}. From (|2TT) . we see that the matched filter output consists of three parts: desired 
signal, mismatch terms due to imperfect alignment and imperfect zero forcing of interference, and receiver 
noise. 

We start with the analysis of the desired signals in (|2TI) . The desired signal at receiver one is 

sxjvljD^v^j + s 2j jv\ j b 1 ^v 2 ,j = + s 2j) (22a) 

for j G {1, . . . , L — 1} and 

s ltL v\ L D ltl v ltL = h 1A \\v 1:L \\s 1:L (22b) 
for j = L, where we have used (fT9l ). Similarly, the desired signal at receiver two is 

si )i+1 v| .Z> 2 ,ifij + i + s 2 ,jvl jD 2 ,2V2,j = h2,2\\v2,j\\(b j+1 si J+1 + s 2 j) (22c) 
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for j e {1, . . . , L - 1}, where we have used (f20l). Using (fi~2~l). (fT3l) , (fl4l) . and CH]), the received signal 
power (for each symbol) satisfies 

LP 

|^i,i| 2 ||v 1;i || 2 ]E(ls fcjJ | 2 ) > \hi tl \ 2 LP > — . (23a) 



at receiver one and 



I?) I 2 I/! I 2 T P 

|/> 2 , 2 |1« 2J || 2 E(M 2 ) > LP > ^- (23b) 



at receiver two (not accounting for the normalization factor bj + %). 

Before we continue with the analysis of the mismatch terms in (|2TI) . we argue that jv^ jT m> kVk,e\ 2 * s 
small. By the Cauchy-Schwarz inequality, 

\vl hj 'Y m ,kVkj\ 2 < \\v m ,j\\ 2 \\Y m ,k\\ 2 \\vk,£\\ 2 , (24) 

where ||T. m fc || 2 denotes the sum of squared diagonal entries of T mfc . By construction, ||vfcj|| 2 = 1. From 
®, ||T mfc || 2 satisfies 

||T m> ,|| 2 <L(7r + l) 2 /z/ 2 . 

By flU) and CE]), 

\\v k j\\ 2 < 4Lc 2 

for k G {1, 2}, where we have used that c > 1 by (PTSl . Combining this with (1241) yields the desired upper 
bound 

t |2 ^ 4L 2 ( 7 r + l) 2 c 2 A 2 
DmiT^fly < 5 =7 • (25) 

The mismatch term in (I2T1) due to imperfect alignment is 

sijvljT^Vij + s 2d vl j r lt2 v 2tj = e^ijsij + e 1)2 j8 2 j (26a) 

at receiver one, and 

s ltj+1 vljT 2tl v 1J+1 + s 2 jvi d Y 2t2 v 2J = e 2j i tj si J+1 + e 2Xj s 2 j (26b) 

at receiver two. Each term e mt kj can be interpreted as the residual channel fluctuation after the quantized 
matching, and satisfies 

|e m ,fcj| 2 < 7 2 (27) 



by (|25]>. 

The mismatch term in (I2TT) due to imperfect zero forcing is 

Oi,j = s i,M,j D i,i v i,t + ^2 s 2t evljD 1<2 v 2/ 

= 22 si/vljY^iV^e + 22 S2,ev\jY lt2 v 2/ (28a) 

at receiver one, where we have used the orthogonality of the received vectors under channel gains D m ,k- 
Similarly, 

d 2 j = ^2 s i,tvl jD 2jl v 2) i + ^2 s 2,£vljD 2 , 2 V 2/ 

= "22 s i,^h^2,iV 2/ + ^2s 2j£ vl d T 2}2 v 2/ (28b) 

eft+i eft 
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at receiver two. Using (TT7T ) and (|25l) together with the independence of the signals s^, the total zero- 
forcing leakage power a 2 is upper bounded by 

a 2 = maxE(|^ mii | 2 ) < 2(L - 1) 7 2 P (29) 



Zm.J i^rn (30) 



= St ||.. || , ■ r ( 31 ) 



at each receiver. 

Finally, the additive noise term 

in (12TI) is circularly-symmetric complex Gaussian with mean zero and variance one, since ||'S m j|| 2 = 1. 

Substituting (|22l> . (Il6l) . (|28T) . and ([30]) into (|2lT) . yields that the output of the jth matched filter at receiver 
one is 

+ S 2,j) + j ^ L, 

^hl,l\\Vl,L\\Sl,L + j = L 

where 

n A I e l,l,j S lj + e l,2j S 2,j + #l,j + ZlJ j 7^ A 

is the sum of the imperfect alignment, imperfect zero forcing, and noise termsH Using (PT71) . (|23~T) . (|271) . 
(1291 ), and (|30l) . the signal-to-interference-and-noise ratio (SINR) for each subchannel at receiver one is 
lower bounded by 

LP/c 2 



SINR! > 



> 



1 + a 2 + 2 7 2 P 

LP/c 2 
1 + 2L 7 2 P 

P/(4c 4 ) 



1 + 2L 2 (vr + l) 2 c 2 P/(z/ 2 c 2 ) 

P/(4c 4 ) 
l + 2L 2 (vr + l) 2 P/^ 2 ' 



(33) 



Similarly, at receiver two, we have 



v 2 i2/2 = /l2,2 1| «2J |] + S 2 j) + fl2,j (34) 



with 

^2,j = 62,1^5!^+! + e2 j 2,jS2,j + $2,j + ^2j. (35) 

Recall that is of the form n or 1/n for some natural number nGli with n < 2c by (fl~5T) . If bj + i = n, 
then both channels have integer coefficients. If bj +1 = 1/n, then we can multiply the channel output by 
n to obtain a channel with integer coefficients. This decreases the effective SINR by at most a factor 4c 2 . 
Following the same steps as before, the signal-to-interference-and-noise ratio is lower bounded by 

SINR * a iTW^W - (36) 

As we had seen earlier, the 6j factor serves as a normalizing term to ensure that all the transmit vectors 
Vk : i have approximately magnitude y/L. From (1341) . it is now clear why bj has to be chosen as a small 
integer or its inverse. Indeed, it is precisely this property that ensures that the subchannels induced by the 
matching of channel gains and the precoder / matched filter have essentially integer channel gains. As we 
will see, having integer channel gains significantly simplifies the task of efficient reliable computation. 
This transformation of the original channel with real coefficients into subchannels with integer coefficients 
is at the heart of the proposed communication scheme. 



2 The noise term fiij depends on the signal Sk,e and is, therefore, not additive. We will handle this difficulty later. 



20 



C. Computation of Functions 

In the last section, we constructed and analyzed the channel induced by the linear precoder and matched 
filter. We now show how to reliably compute functions over this channel from the precoder input to the 
matched filter output. 

Consider all time slots in the first subblock with quantized channel realization H G 1 L L KxK . By LemmaHl 
with probability at least 1 — £\{T) there are at least 

T< A > a (1 _ n(T))p A (H)T/L (37) 

time slots in the first subblock that have this quantized channel realization. By the matching construction 
in Section IV-A1 the first such time slots can be successfully matched with time slots in subblocks 
£ £ {2, . . . , L} with quantized channel realizations chosen according to ©. 

By (I3TT) and (1341) . the precoding and matched filtering scheme from Section IV-B[ transform each group 
of L time slots into L — 1 subchannels of the form 

rt] M = $ 0$ [A + 4? W) + A$ M (38a) 
r<3>[t] = $\a§\ l8 %[t] + a$8$}[t]) + $[t] (38b) 

for j G {1, . . . , L — 1}, and where sj^ are the channel inputs, a^ +1 and are nonzero integers, j3^] 

are positive scaling factors, and //^[t] an d A*2^W ^ interference and noise as in (|32l and (I33T ). Receiver 
one observes one additional subchannel of the form 

rSW=<M2M+M2M- (38c) 
From (|33l and (|36l ), the SINR to all of these subchannels is lower bounded by 

SINR(H) = minSINR m (H) 

m 

P/(16c 6 (H)) 

- 1 + 2L 2 (tt+1) 2 P/z/ 2 ' 1 ; 

where we have explicitly written out the dependence of c and SINR on H. 

Each transmitter k splits its message Wk into non-overlapping submessages w^j, one for each subchannel 
j of quantized channel realization H. Each such submessage is a vector with components in {0,1, ...,(? — 
1}. Receiver one attempts to recover the functions 



u 



(H) a f w hj + "'w ( mod i) if j e {1, . . . , L - 1}, 
[it;^ otherwise, 



over subchannel j G {1, . . . , L}. Receiver two attempts to recover the functions 

(H) A (H) (H) (H) (H) / i \ 

U 2,j = a lJ+l W lJ+l + a 2,j W 2j ( mod <l) 

over subchannel j G {1, . . . , L — 1}. 

(H) 

These equations are clearly invertible. Indeed, receiver one decodes w\ £ alone. Receiver two computes 

(H) (H) (H) 

a linear combination with nonzero coefficients of w 2 [_ x and w\ £. Knowing w\ £, we can thus recover 

(H) 

w 2 £_ x . Continuing in the same manner, alternating between the receivers in each step, we can successively 
recover all transmitted messages. This shows that the mapping between the messages at the transmitters 
and the decoded functions at the receivers is invertible. 
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Fix a quantized channel realization H. Applying L times^ Lemma [5] in Appendix lAl guarantees that 
over the subchannel (1381 ), a computation sum rate (normalized by the number of time slots in the 
subchannel) arbitrarily close to 

(2L-l)log(SINR(H)) 

is achievable with average probability of error at most e 2 H \t^) — > as — > oo. In terms of the 
original blocklength T, this translates to a computation sum rate of 

(2L-l)^-log (SINR(H)). 

Moreover, since T( H ) oo as T oo, and since, for fixed quantization parameter v there are only 
finitely many values of H, we also have 

e 2 (T) =max4 A) (r (A) ) 

H 

as T — > oo. 

We repeat the coding procedure above for all quantized channel realizations H with finite magnitudes, 
i.e., satisfying ||H||oo < oo. The overall computation sum rate is at least 

(2L-1) ^-log(SlNR(H)) 

H:||H|| 00 <oo 

>^^(1-^)) J2 P A (H)log(SINR(H)) 

H:||H||oo<oo 
H:||H||oo<oo 

> - V(T)) ((1 - *(")) log ( 1 + 2L2 ^' 6 1)2p/i/2 ) " 6E( log(c(H)); ||H|U < oo)) , 

where (a) follows from (1371) . (6) follows from (l39l) . and (c) follows from ©. Here, the (1 —i](T)) factor 
accounts for the loss in matching the channel gains at times ti, . . . , t L , and the factor (1 — 5(i / )) accounts 
for channel realizations that are quantized to oo, see Section IV-AI Both r/(T) — > as the blocklength 
T — > oo by (flOl) and 5{u) — > as the quantization parameter v — > oo by (fTTT) . 

There are two sources of error in this communication scheme: atypicality of the channel gains and 
atypicality of the noise terms. The channel gains are handled by the matching construction described in 
Section IV-AI We declare an error whenever the channel gains are atypical, which happens with probability 
at most Ei(T) with ei(T) -)■ as T -)■ oo for fixed L and v by ® and ©. The noise is handled by the 
computation code over the integer channel. As we have seen above, an error occurs with probability at 
most £ 2 (T) with e 2 (T) — > as T — > oo for fixed L and z/. Since the number of finite quantized channel 
gains is at most u 4 L by ©, and since the number of receivers is 2L — 1 < 2L for each such realization 
of the quantized channel, with probability at least 

l-e l {T)-2v A L 2 e 2 {T) 

all decoders are successful. For a fixed number of subblocks L and fixed quantization parameter z/, this 
quantity converges to one as T — > oo, yielding an achievable computation sum rate of 

mL,u) 4 ^((1 -6{u)) log ( 1 + 2L2 P { i^ 1)2p/u ) -6E(log(c(H)); ||ff|U < oo)). 



3 Since the input symbols at the two receivers for different values of j £ {1, . . . , L} are coupled, we need to make use of the universality 
of the channel encoders mentioned after the statement of Lemma [5] 
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Hence the computation capacity C(P) is lower bounded as 

C(P) > R(P,L,u). 

Since this is true for all values of u, we may take the limit as v — > oo to obtain 

C(P) > lim R(P, L, v) 



U—¥oo 

2L - 1 



log(P/16) - 6 lim E(log(c(JT)); \\H\\oo < oo) 



L 

In Appendix El we show that 

lim E(log(c(H)); \\H\\oo < oo) < 3. 

v— >oo 

Thus, the computation capacity is lower bounded by 

C(P) > lim R(P, L, v) 
2L — 1 

= — — (log(P/16)-18). 
Finally, we may take a limit as L — > oo, yielding a computation rate of 

C{P) > lim lim R(P, L, v) 

L— ¥oo u—>oo 

= 2 log(P) - 44 
> K\og(P) - 7K 3 , 

concluding the proof of the lower bound in Theorem \T\ for K = 2. ■ 

VI. Proof of Lower Bound in Theorem Q]for K > 2 Users 

As in the two-user case in Section |Vj the proof for K > 2 proceeds in three steps: matching of channel 
gains (see Section IVI-AI) . linear precoding and matched filtering (see Section IVI-BI) . and computation of 
functions of the messages over the resulting channel from the precoder input to the matched filter output 
(see Section IVI-CI) . We again quantize all channel gains as described in Section [IV] and consider large 
blocklengths T such that this quantization can be performed for arbitrarily large quantization parameter v 
and such that the resulting observed sequence of quantized channel gains is r/-typical with high probability. 
Since the effects of quantization and atypicality are essentially identical to the two-user case, we will not 
repeat this analysis here and instead assume directly that v oo, which implies that h m> k\t\ ~ h m; k[t]. 
The quantization and typicality arguments for K = 2 carry over for K > 2. 

A. Matching of Channel Gains 

Fix a large blocklength T and a natural number /. Define 

L±(I + lf\ 

and divide the block of T channel realizations into L subblocks of length T/L (assumed to be integer). 
Consider the channel gains at time ti — 1 in the first of these blocks and the corresponding channel gains 
H[ti\. Let t£ be the first time in block £ such tha^l 

h m) k[te) = ^ m ' h h m ,k[ti\ 

4 The probability of this event happening is, of course, zero. The statement is to be understood in terms of the quantized channel gains 
h m ,k\i\ and sufficiently large v so that h m ^\t] ~ h m ,k[i\- 
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for all k, m G {1, . . . , K}, where uj l is the Lth root of unity as before, and where 

rf m)fc 4(/ + i)(*-D^- 1. 

Repeat this construction with t\ = 2 and so on, ensuring that no time slot is matched more than once. 

By the assumptions of circular symmetry and ergodicity of the fading gains, essentially all but a fraction 
o(l) of the channel gains can be matched up in this fashion as T — > oo (see Lemmas [3] and H]), and we 
will assume in the following that T is large enough to ignore the o(l) term (see Section IV-AI for a detailed 
analysis). 



B. Precoding and Matched Filtering 

Consider now one such sequence of matched time slots ti, . . . As in the two-user case, we use 
linear precoders and matched filters over the vector channel induced by these L time slots. Define the 
diagonal matrix 

D m>k = diag ((h m , k [te))f =l ) 

corresponding to the vector channel of length L between transmitter k and receiver m at time slots 
ti, . . . ,tL- By construction, 

I^m,k h mk F j 

where 

hm,k ^m,fc[^l] 

and 

F^diag^i" 1 )^). 

Each transmitter uses again a linear precoder with transmit vectors v G V C C L . The set V is constructed 

as 

V = \ v = (il ( if t\)Dl:Al : a m , k G {0,...,/- 1}}. 

^ m,k a=l ' * 

Since all channel matrices D m ,k are diagonal by construction, the product D^D^ k commutes, and 

hence it is immaterial in which order the product in the definition of V is taken. The scalars are 

constructed recursively, starting from b^ k . Each is of the form n or 1/n for the smallest natural 
number n G IN such that 

^m, k 



a=l 



As in the two-user case, the role of the is to ensure that the transmit vectors all have approximately 
the same norm. In particular, 

HI < 2 K2 VZ (40) 
for every v G V. Moreover, by the recursive construction, 

(2max {\h m>k \, {hm^ 1 }) 1 < min {b^\, l/b^' k } < max {b^\, l/b^ k } < 2max {|/i m)fc |, |/i mifc | -1 }, 

(41) 

and hence 

(2 k2 c)- 1 < u^{tr\vtr ] } < ri max {& fc) ,i/fe fc) } < ^ K2 c m 

m,k m,k 
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for all a mtk E {0, — 1}, and where 

c = c(H) = JJmax{|/i m) jfc|, l/im,^ -1 }- (43) 

Observe that, as in the two-user case, each transmit vector v G V is of the form 

v = pF a l 

for some scalars p E C and a G IN. By (PT6l) . this implies that any two transmit vectors in V are either 
collinear or orthogonal. As we will see next, all vectors in V are, in fact, orthogonal. 

Each v E V is a complex- valued vector of length L defined by a monomial up to power I — 1 in the 
channel matrices D nhk . By definition, every collection of powers a mjk E {0, . . . , I— 1}, m,kE {1, . . . , -R'} 
corresponds to an element uG V. We now argue that this correspondence is one-to-one. implying that 

|V| = i K \ 

Moreover, together with the argument in the last paragraph, this will also ensure that all vectors in V are 
orthogonal. 

To this end, consider v E V and write it as 



m,k 



for some a mjk E {0, . . . , 1 — 1} and scalar p. The first component of v is equal to p. The second component 
of v is equal to pul with 



A 

a = 



^ oi m ^ k d m ^ k (mod L) 

m,k 

Y J ^ m ,k{I + ^) {k ~ 1)K+m ~ l ML). 



Since each a m fc G {0, — 1}, this last sum is less than L, and so the modulo L operation can 
be dropped. Thus, the coefficients a m fc of a can be determined uniquely by computing the (/ + l)-ary 
expansion of a. Moreover, knowing p from the first component of v, a can be uniquely determined from 
the second component of v. Together, this shows that there is a unique collection of powers a m ,k G 
{0, . . . , I — 1} for all m, k E {1, . . . , K} that generates v. We refer to this as the unique factorization 
property of V. Since each exponent a corresponds to a unique v E V, this also shows the orthogonality 
of the vectors in V. 

Each transmitter modulates I K zero mean and mutually independent message symbols over its transmit 
vectors. Let s k ^ v be the message symbol at transmitter k sent along transmit vector v E V. The channel 
input 

x k = (x k [tx\ x k [t 2 ) ■ ■ ■ x k [t L }) 



at transmitter k has then the form 



We allocate the same power 



Xk = ^ S k,vV. 

vev 



HM 2 )=^r l = P (44) 
to each s k , v - Since each transmit vector v has squared norm at most 4 r2 L by (|40l) . we have 

^E(W| 2 ) < M x ^ x A* 2 L < P, 



25 



satisfying the average power constraint over the L time slots ti, . . . , t^. Since each of the K transmitters 
has I K transmit vectors, we transmit a total of KI K independent data streams over L — (I + 1) K 
channel uses. 

The corresponding vector of channel outputs 



Vm 



at receiver m is then 



where 



(Vm[h] Vmih] ■■■ y m [tL\) T 
K 

Vm ^ ^ -D mk X k + Z m 

k=l 
K 

= ^2 ^2 s k,vD m ,kV + z m , (45) 

k=l v£V 

[t 2 ] ■■■ z m [t L }) 



is the additive noise at receiver m. 

From (|45l) , transmit vector v G V is observed at receiver m as D m ^v. Each receiver m the receive 
vectors 

V m = {v = D m , k v/\\D m , h v\\ : k G {1, . . . , K}, v G V} C C L 
as matched filters, computing v^y m for each v e V m . The number of matched filters is at most 

\Vm\ < (I + lf 2 - 

By the same argument as for V, it can be shown that V m also has the unique factorization property. 
In other words, to every v G V m corresponds a unique collection of powers ct m>fc G {0, . . . , J} for all 
m, k G {1, . . . , K} such that 

^ _ \ ( I I _p 1 "m,fc<im,fc N \ I 

m,fe 

As for V, this implies that the vectors in V m are orthogonal by the properties of the "Fourier" matrix F. 

The equivalent channel, consisting of the linear precoder, the wireless channel, and the matched filters, 
has I K channel inputs at each transmitter and at most (I + \) K channel outputs at each receiver. Since 
the matched filters are normalized to have unit norm, each such subchannel at the receiver is an additive 
Gaussian noise channel with unit noise power. We now argue that we have again signal alignment as in 
the two-user case. 

As pointed out above, the transmit vector v G V at transmitter k is observed at receiver m as D m ^v. 
By construction of the set of matched filter vectors V m at receiver m, D myk v is a scalar multiple of a 
vector v G V m . Since all the vectors in V m are orthogonal, this implies that the matched filtering operation 
v^y m removes all but those transmit signals which are aligned with v. 

We now analyze the magnitudes of the signals that are observed along one receive vector v G V m at 
receiver m. By unique factorization, there exists a unique collection of exponents ^ G {0, ...,/} such 
that 

m,k 

for some scalar p. Assume a signal modulated over transmit vector v k at transmitter k is observed along 
vector v at receiver m. Note that this is only possible if a myk G {1, . . . , /} and a~ n ~ k G {0, 1} for 
all (m, k) 7^ (m, k). The transmit vector v k has then to be equal to 



•.=(n«Kr n (n»&K?>- 

a=1 (fh,k)^(m,k) a=1 
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Defining 

Ylrhl Yla=l k 



A llm,* 110=1 h 

b = 



- 1 ±K m.K 



and 

,(<* r) 



fc±nCr. <46) 

this allows to write in terms of v as 

Since the collection of exponents £ corresponding to v is unique, and by orthogonality of V, this 
implies that there are at most K signals that are aligned along the same vector v at receiver m, and they 
are all observed with the same common channel gain times a factor b k depending on the transmitter k. 
Using the orthogonality of the matched filters and (l47l) . the output of the matched filter applied to the 
channel output (|43T ) can then be written as 

K 
K 

^ Sk,v k ^ D m ^ k V k -\- Z m fi 
k=l 

b K 

= - bkSk,v k + z m,v, (48) 
P k=l 



where 



A ~t 



is additive circularly-symmetric complex Gaussian noise with mean zero and variance one, and where 
v k = v k{v,m) depends on both the matched filter v and the receiver m. We can interpret (|48l) as a 
subchannel between the inputs to the precoder v k at each transmitter k and the output of matched filter 
v at receiver m. 

We point out that, similar to the two-user case, not all K transmitters contribute to all matched filter 
outputs v^y m . Indeed, if a m ^ = in the unique factorization of v at receiver m, then there is no 
corresponding transmit vector v k at transmitter k. For ease of notation, we assume that s k iV = in this 
case, so that (l48l is still valid. 

We now bound the channel gains in the matched filter output (1481) . From (|47T) . we have 

IL/ I \\ V k\\ \h m ,k \\\ V k\\ 

b k\\D m>k v\\ b k 

By (@B, gl, and ©, 

\h m ,k\ l bk — \h m ,k\ 1 Y\.^ > m"k' k —2 K C. 

kjtk 

Together with (1401) . this shows that 

Wp\ > ^ (49) 
Moreover, each b k is is a product of at most K scalars, each being either a natural number or its inverse. 
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We want to multiply the output of the subchannel (1481 by a positive scalar p such that pb k E IN for all 
k. By the definition of b k in (|46l) , we can choose 

P = fl.nax {1,1/Cf'}. 

k=l 

Using dH]) and (@2l), we thus have 

P < 2 K c, (50) 

resulting in a decrease of effective signal power by at most a factor A K c 2 . 

To summarize, the channel (1481) between the input to the matched filter at transmitter k and the 
output of the matched filter v 6 V at receiver m is of the form 

K 

^m,v $m,v ^ ^k^kjV^ pm,vi (51) 
k=l 

for nonzero integer channel gains a k , scaled Gaussian noise p m ,%, and positive scaling factor (3 m ^. Using 
(144]) . (l49l) . and (|50l) . the signal-to-interference-and-noise ratio SINR of this subchannel is lower bounded 
by 

P\b/ P \ 2 



SINR > 



P 2 



P/(A K2 L) x L/(A K c 2 ) 
P 

C. Computation of Functions 

We use a computation code over the channel from the precoder input to the matched filter output 
constructed in the last section. This will allow us to reliably decode functions of the transmitted messages 
over this channel. 

As in the proof of the two-user case, we code over several channel uses, each with the same channel 
realization H. For each such H, we are hence dealing with a channel that is constant across time. 
Each transmitter k splits its message w k into non-overlapping submessages, one for each subchannel 
(I5TI) between precoder input and matched filter output, and for each channel realization H. Each such 
submessage is again a vector over {0, . . . , q — 1} for some q. The decoder aims to compute a modulo-g 
integer linear equation of these messages with coefficients a k as appearing in (|5TI) . 

Using the unique factorization property of V and the fact that all coefficients a k are nonzero, it follows 
from [4, Lemma 8] that the functions to be decoded by the receivers can be inverted. Hence, knowledge 
of all correctly decoded functions at the receivers allows recovery of all the messages. 

Applying L timesLl Lemma [5] in Appendix IA1 shows then that each of the receivers can reliably compute 
its desired functions over the channel given by (I5TI ) at a sum rate at least 

KI r2 log (SINR(H)) > log(P) -AK- IK 2 - 41og(c(H)) 

for a particular realization H of the channel gains, and where we have used (1521) , that the number of 
transmitted messages is |V| = I K , and that there are K receivers. Normalizing by the number (/ + 1)^ 
of channel uses, we can hence achieve a sum rate of at least 

KI R2 / 

m I] ~ (j + iw 2 ( log ( p ) " AK ~ 2R2 ~ 4E ( lo s( c ( 



5 As in the two-user case, the input symbols at the K receivers are coupled. We make again use of the universality of the channel encoders 
mentioned after the statement of Lemma [5] 
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when averaged over all channel realizations. 

The computation sum capacity is then lower bounded as 

C(P)>R(P,I). 

Since this holds for all values of /, and since the constant c does not depend on /, we may take the limit 
as / — > oo to obtain a computation rate of at least 

C(P) > lim R(P, I) 

I— s-oo 

= Klog(P) - 4K 2 - 2K 3 - 4KE(log(c{H))) 
> K\og(P) - 7K 3 , 

where we have used the upper bound on the expected value of log(c(i?)) in Appendix E This concludes 
the proof of the lower bound in Theorem Q] for arbitrary K > 2. ■ 

VII. Proof of Upper Bound in Theorem Q] 
The proof adapts an argument from HOl Theorem 4]. 

Since the receivers compute an invertible function of the messages, the cut-set bound I14TI Theorem 
14.10.1] applies, showing that 

C(P) < sup E(log det(J + HQ(H)H^)), 

Q{H) 

where the maximization is over all positive semidefinite matrices Q(H) such that 

E(tr(Q(if))) < KP. 
Using Hadamard's inequality, this can be upper bounded as 

K 

sup E(logdet(/ + //Q(/f)/f t )) < Y] sup E(log(l + h m Q(H)hl)) 

Q(H) m=1 Q(H) 

< Ksup E(log(l + rP(r))), 

P(r) 

where h m denotes the mth row of H, where 

A lit ||2 
r = \\ h l\\ , 

and where the last maximization is over all nonnegative P{r) satisfying 

E(P(r)) < KP. 

This upper bound on C(P) is maximized by water-filling [|42l . yielding 

C(P) < KE( log(l + rP*(r))) 

with 

P*(r) = (--- 
V/i r 

and [i such that 

E(P*(r)) = KP. (53) 

Since 

P*(r) < -, 
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we can further upper bound 

C(P) < #E(log(l + r//i)) 

< A"log(l + E(r)//i), (54) 

where we have used Jensen's inequality. 

It remains to lower bound /i. By (|53l ), we have 



KP = E(P*(r)) 



=2/x 

> ^P(r > 2 ri . 

The random variable r has Erlang distribution with parameter K and rate one, and hence 

KP > — P(r > 2/i) 

>i-exp(-2„). 
If [i < 1/AKP, then we obtain the contradiction 



KP> i-exp(-2/i) 



> 2irPexp(-l/2is:P) 

> isTP 



for if > 2, P > 1. Hence /i > 1/4KP. 
Substituting this into (|54l) yields 



C(P) < iflog(l + 4XPE(r)) 
= ATlog(l + 4fT 2 P) 
< K\og(P) + 5K\og(K), 

where we have used P > 1 and K > 2. This concludes the proof of the upper bound in Theorem [T] ■ 

VIII. Proof of Theorem [2] 

This section provides the proof for the approximation result of the sum capacity (P) of the P-layer 
relay network. The proof builds on the approximation result for the computation sum rate in Theorem \T\ 
Since the upper bound in Theorem [2] follows directly from the same cut-set bound argument as Theorem [H 
we focus here on the lower bound. 

Each of the D network layers operates using compute-and-forward. We use the same codebook rate 
Rk = R at each source node k E {1, . . . , K}. Using Theorem [TJ the relay nodes at layer one can then 
reliably decode a deterministic invertible function of the messages at sum rate at least 



K log(P) - 7K 3 . 
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Since the blocklength used is arbitrarily long, the probability of decoding error at the relays can be made 
smaller than ejD for any e > 0. 

The relays in layer one treat these computed functions as their messages for the destination node, and 
re-encode them using again a computation code. In order to make this argument inductively, we will 
apply Theorem \T\ for each layer. Two difficulties arise. First, the statement in Theorem \T\ is only for the 
computation sum rate and it is not clear how much each individual transmitter and receiver contributes 
to this sum. For the induction argument, we need to argue that we can choose the message rates at the 
transmitters to be symmetric, and that we can choose the rates of the decoded functions at the receivers to 
be symmetric. Second, the definition of computation capacity stipulates only that the receivers decode an 
invertible deterministic function of the messages. In particular, the sum rate of the decoded functions at any 
receiver could be larger than the sum rate of the transmitted messages. For example, if a receiver decodes 
a sum over Z of two messages, then the entropy of this decoded function is larger than the entropy of 
either of the messages. For the induction argument, we need to argue that the we can choose the functions 
to be computed at the receivers to be over the same alphabet as the messages at the transmitters, thus 
avoiding growth of the messages as they traverse the network. 

From the proof of Theorem [Q we see that the rates of the messages at the transmitters as well as the 
rates of the computed functions at the receivers are indeed symmetric as the time expansion parameter 
L — > oo (see Sections [V-CI and [VI-CI ). Moreover, the messages at the transmitters as well as the computed 
functions at the receivers are all over the same finite field of size q (see again Sections I V-CI and I VI-CI) . 
Thus, the message sizes do not increase as they traverse the network. 

We can therefore inductively apply Theorem Q] to conclude that the relays at layer d in the network can 
decode a deterministic invertible function of the messages at layer d — 1 for all d e {1, . . . , D} at sum 
rate at least 

mog(P) - 7K 3 . 

Since the composition of invertible functions is invertible, this implies that the relay nodes in layer D 
compute a deterministic invertible function of the messages at the source at this sum rate. 

Since the relay nodes in the last layer are connected to the destination node by orthogonal bit pipes 
of infinite capacity, they can forward their computed message to the destination. The destination node, in 
turn, can then invert these K functions to recover the original messages. Since the probability of decoding 
error is at most e/D in each layer, this implies that the destination node decodes in error with probability 
at most e by the union bound. Since e > is arbitrary, this proves the lower bound in Theorem [2] ■ 

IX. Conclusions 

We have considered time-varying Gaussian relay networks consisting of K source nodes communicating 
to a destination node with the help of D layers of K relay nodes. We have presented a capacity 
approximation for this type of communication network. The gap in this approximation depends only 
on the number of source nodes K and the fading statistics, but is independent of the depth D of the 
network and the transmit power P. This contrasts with previously known approximation results, which 
have a gap that increases linearly with the depth D of the network. 

At the heart of our achievable scheme is the concept of computation alignment, combining computation 
codes with signal alignment. The use of computation codes allows the relay nodes to remove receiver 
noise, thus preventing noise from accumulating as messages traverse the network. The use of signal 
alignment allows the transformation of the wireless channel with time-varying complex-valued channel 
gains into subchannels with constant integer-valued channel gains, over which these computation codes 
can be used efficiently. 

Appendix A 
Computation Over Integer Channels 

The channel matching and precoding / matched filtering steps in Sections |Vl and IVTl transform the time- 
varying linear channel with arbitrary complex channel gains into a constant linear channel with integer 
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channel gains. In this section, we analyze how to reliably compute functions over this channel. We will 
employ the compute-and-forward scheme from [3], being well-suited for such constant linear channels 
with integer channel gains. 

Throughout this section, we consider the subchannels (1381) and (IFTT) . Specifically, relay m observes 



A 



^m,k^k 

[t]+fi m [t] (55) 



k=l 

where j3 > is a positive real scaling factor, a m ^ 6 Z are integer channel coefficients, s k [t] £ C are the 
symbols sent by transmitter k, and 

K 

Hm[t] = 22e m)k [t\s k [t] + 8 m [t] + z m [t] e C 

k=l 

is the sum of interference and noise terms. Part of the interference is due to residual channel fluctuations 
e m ,fc[i] and the remainder is due to leakage from other subchannels written as 9 m [t]. We assume that 

|em,fc[*]| < 7 2 

for all m,k, and for some finite constant 7 2 not depending on m,k,t. Finally, z m [t] is i.i.d. circularly 
symmetric Gaussian noise with mean zero and variance one. Each leakage term 9 m [t] has expected power 

2 



me m [t}\ 2 ) < 



a' 



and is independent of the symbols Sk[t] for all m, k, and t. Over a block of length T, we impose an 
average power constraint of 



T 

" < P. 



I 1 



T 

t=i 

It will be convenient to express the messages at the transmitters as well as the functions computed at 
the receivers in some finite fieldH To this end, we write the message u> fe at transmitter k as a vector w k 
of length k with components in {0, . . . , q — 1} for some prime number q. Receiver m aims to recover the 
function 

K 

u m = ^2 a m,kW k (mod q) 
k=i 

where a m fc are the same integer-valued coefficients that appear in (155T ). We will assume that these 
coefficients are chosen so that the resulting functions are invertible. Since we transmit K messages with 
alphabet size q K over T channel uses, the computation sum rate (in bits per channel use) is 

K^og(q). 

The following theorem, which is a special case of (31 Theorem 1], lower bounds the computation sum 
capacity of the channel (1551) . 



Lemma 5. The computation sum capacity of the channel (1551) is lower bounded by 

Klog(SINR) 

with 

SINR 



P 2 P 



l + a 2 + Kj 2 P' 



6 This property will be quite useful in the analysis of D-layer relay networks as it ensures that the rates of the recovered functions are the 
same as the transmitted messages. 
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We point out that the codebooks at the K transmitters in Lemma [5] are chosen independently of the 
coefficients a rrhk . In other words, the encoders are universal with respect to the channel and equation 
coefficients a m:k . 

Proof of Lemma \5} We provide an outline of the proof of Lemma [5] and refer interested readers to 
[fJl for a comprehensive treatment. 

Since the channel coefficients are integers, the real and imaginary parts of the inputs will not mix. 
Therefore, we can write the length T complex-valued channel input and output vectors as length 2T 
real-valued vectors, and treat the channel as essentially a real channel of blocklength 2T. 

To ensure that integer combinations of codewords are themselves codewords, each transmitter employs 
a nested lattice codebook [|43l . Formally, a lattice A C R 2T is a discrete subgroup of 2T-dimensional 
Euclidean space, and has therefore the property that Ai, A 2 G A implies Ai + A 2 G A. To construct a nested 
lattice codebook, we first generate two lattices Ap C R 2T and Ac C R 2T such that Ac C Ap c R 2T , i.e., 
Ac is a sublattice of Ap. We refer to Ac as the coarse lattice and to Ap as the fine lattice. The codewords 
are those elements of the fine lattice Ap that fall into the fundamental Voronoi region V of the coarse 
lattice Ac- 

All transmitters use the same 2T-dimensional nested lattice code. Each message vector w k is mapped 
into a lattice point A fc G Ap fl V. This mapping is one-to-one, i.e., to each A fc G Ap fl V corresponds a 
unique message vector w k . Moreover, this mapping can be chosen to preserve linearity (see (31 Lemma 5, 
Lemma 6]). 

To each encoded lattice point we add a random dither d k generated independently and uniformly over 
V to get the length 2T channel input 

Sk = A fc + d k (mod A c ) 

where (mod Ac) denotes ordinary addition in R 2T modulo the coarse lattice Ac- By Il43l Lemma 1], 
due to the random dither, the signal s k is independent of the lattice point X k . We make all of the dither 
vectors available to all relays as common randomness. 
The (scaled) channel output can be written as 

/rV 

Since all receivers are assumed to have access to the dithers, receiver m can remove them from its desired 
integer combination to form 

K K 

/3~V m - ^2 a m,kd k = ^2 a m,k^k + /3"Vm (mod A c ). 

k=l k=l 

Due to the dithering step, the vector \x m is the sum of terms that are all independent of the lattice points 
Afc. Hence, \x m can be treated as additive noise. Its expected power is upper bounded by 

^E(||/U 2 ) <l + a 2 + K^P. 
Following the arguments in [3, Theorem 1], all K relays can recover the lattice point 

K 

^a m ,fcAfc (mod A c ), 
fc=i 

and hence decode their equations 

K 

^2a mjk w k (modg), 
k=i 



K 
k=l 
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with vanishing probability of error if the rate if the lattice codebook satisfies 



1 1 / R 2 P \ 

-log|A F nv|<-log( ) 



2T °' 1 2 ° VI + a 2 + K^ 2 P, 

Since there are .K" receivers, and since one channel use of the original complex channel corresponds to 
two channel uses of the real channel, the computation sum rate is 

/ (3 2 P 

Finally, since this scheme works with expectation over the random dithers, there is (at least) one set of 
fixed dithers that can attain this performance, and therefore no common randomness is actually necessary. 

■ 

Appendix B 

Upper Bound on the Expected Value of \og(c(H)) 
In this section, we derive the upper bound 

3K 2 

lim E(log(c(H)); WH^ < oo) < — - 

as the quantization parameter v — > oo. 

The term c depends on the quantized channel gains H, and hence, implicitly, on the channel gains H 
and the quantization parameter v. With slight abuse of notation, we write 

c(H) = c(i2». 

We then have 

E(log(c(H)); IIHIU < oo) = ^ log(c(ff))p A (H) 

H:||H||oo<oo 

hiihii ^ ■/Hec?- 1 (H) 

n: || n || oo <oo 

log(c(H,i/))/ jr (H)dH 
E(log(c(ff, i/)); ||JET Hoc < v) 



H:||H|| 00 <i/ 



by Fubini's theorem, and where fn denotes the density of H and Q the operation of the quantizer. 
From the definition of c, and using ©, 

c(H, v) = JJmax{|/i m)fc |, \h m ^ 1 } 

m,k 

< 2 A ' 2 JJmax{|/i mifc |, |/i m ,fe| _1 } 

m,k 

for H such that ||i3"||oo < v. Hence, 

log(c(JET, i/))l{||fr|| 00 < i/} < K 2 + J2 l °s(^{\h m ,k\,\h m ,k\~ 1 })- 

m,k 

Since 

E(log (max{\h m>k \, |/i m , fc | -1 })) < oo 
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by assumption on the fading process, this implies that 

lim E(log(c(i2>)); WHl^ < u) = E( lim log(c(i3»)) 

by dominated convergence. Since H converges to H almost surely as v — )■ oo by the construction of the 
quantizer, this yields 

Jim E(log(c(if, i/)); H-EflU < f ) = J^E(log (max \\h m ^\, \h„ hk \~ 1 })) 

m.k 

K 2 

= — E(log(max{|/i lil | 2 ,|/i 1 , 1 |- 2 })). (56) 
It remains to upper bound the expectation over h\ t \. Since |/ii ; i| 2 has exponential distribution, we have 

/*1 poo 

E( log ( max { |/ii,i| 2 , \hi ,i| -2 })) = — / exp(— s) log(s)cis + / exp(— s) \og(s)ds 

Js=0 J s=l 

= ( 7 -2Ei(-l))log(e) 
< 1.5, 

where 7 is the Euler-Mascheroni constant. Combining this with (|56l) shows that 

IK 2 

lim E(log(c(if)); IIHIL < 00) = E( lim log(c(i2»)) < . 

V— >OD f— S-OO 4 
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